Scaling issues in ensemble implementations of the Deutsch-Jozsa algorithm 
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We discuss the ensemble version of the Deutsch-Jozsa (DJ) algorithm which attempts to provide 
a "scalable" implementation on an expectation-value NMR quantum computer. We show that 
this ensemble implementation of the DJ algorithm is at best as efficient as the classical random 
algorithm. As soon as any attempt is made to classify all possible functions with certainty, the 
implementation requires an exponentially large number of molecules. The discrepancies arise out of 
the interpretation of mixed state density matrices. 
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Conventional NMR implementations of quantum com- 
puting algorithms require the preparation of pscudopurc 
states lli IE ^ 1^. There have been a few propos- 
als to efficiently implement the Deutsch Jozsa (DJ) algo- 
rithm on an NMR quantum information processor using 
highly mixed states 0, 0] ■ The basic idea in these en- 
semble schemes is to avoid pseudopure state preparation 
which would require exponential resources and instead 
work with highly mixed states close to thermal equilib- 
rium. These schemes need to be carefuly examined for 
their 'quantum character' and their efficiencies compared 
to classical random algorithms. In this paper we show 
that for the DJ problem, a parallel can be drawn between 
these ensemble implementations and classical random al- 
gorithms. 

We begin with a brief recapitulation of the DJ problem. 
Consider the set of functions / : {0, 1}" — > {0, 1}. If all 
2" inputs map to the same output then the function is 
'constant' and if half the outputs map to while the other 
half to 1 then the function is 'balanced'. Functions which 
are neither 'constant' nor 'balanced' are not considered 
here. The task here is to determine the constant or bal- 
anced nature of a given function. Given an oracle which 
evaluates f{x) at an input x G {0,1}", no determinis- 
tic classical algorithm can carry out such a classification 
with certainty without using at least 2"~^ + l invocations 
of the oracle. The quantum DJ algorithm on the other 
hand accomplishes the classification task by invoking the 
oracle only once iS, 9] . This it does by using a quantum 
oracle defined through the unitary transformation on an 
n qubit argument \x) and a one qubit target \y), 
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If a query to the oracle is assumed to come at a unit cost 
then the quantum DJ algorithm provides an exponential 
speedup over its classical counterpart. The n + I qubits 
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need to be in a pure quantum state for the algorithm 
to work. In liquid state NMR at room temperatures, 
which is the most successful implementation of quantum 
information processing to date, the quantum states of the 
spins are far from pure. Therefore, to emulate the stan- 
dard version of the DJ algorithm, one has to prepare the 
system in a special 'pseudopure' state where the ensem- 
ble is divided into two parts: a small subset in a given 
pure state, and the rest acting as a uniform background 
with no contribution to the signal. However, the present 
preparation schemes for such states lead to an exponen- 
tial loss of signal because the subset of spins which can 
be prepared in a 'pure' state decreases exponentially with 
the number of qubits 0, 0| . 

Recently, alternative schemes have been proposed to 
circumvent this difficulty 0, Q • These effectively use a 
computer with n + 1 qubits, where the first n qubits are 
represented by a density matrix i/ (a fully mixed state) 
and the last qubit is in the pure state |0). The initial 
state thus is 
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and rewriting it in the computational basis {|a;)|0) \x = 
- • - a; = 2" - 1} gives 



2"-l 
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This preparation is followed by the standard quantum 
oracle query described in Eq. yielding 
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Before we extract the constant or balanced nature of the 
function we note that this is not an entangled state. The 
entanglement is missing because of the special choice of 
the initial state. As a matter of fact the oracle is ca- 
pable of generating entanglement and the standard pure 
state version of the quantum algorithm relies on entan- 
glement lEIll- 

The information about the function is contained en- 
tirely in the target qubit whose reduced density matrix 
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The expectation value of cr^ in this state will immediately 
reveal the 'constant' or 'balanced' nature of the function 
/, with the result: 



±1 Constant / 
Balanced / 
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By actually carrying out such a measurement the func- 
tion can be classified with a single invocation of the quan- 
tum oracle without the associated problems of preparing 
pure or pseudo-pure states. This scheme is particularly 
suitable for implementation on an NMR quantum infor- 
mation processor where the thermal equilibrium state can 
be easily transformed into the maximally mixed state of 
Eq. |j2Il and expectation value measurements are natural. 

For comparison we describe a classical scenario which 
in essence mimics the 'quantum' scheme described above. 
Instead of the NMR qubits, consider classical bit strings 
of length n -I- 1. Further, assume that we have 2" such 
strings and each string is in a different state for the first 
n bits, thereby providing representation to all possible 
states of the first n-bits. The (n + l)th bit in each string 
is set to '0' and acts as the target bit. Now the application 
of the classical oracle (x — > f(x) with f[x) appearing on 
the target) to all the copies will yield the function values 
at all the 2" input points and this value will be stored 
in the target bit in each copy. The constant or balanced 
nature of the function can then be obtained by adding 
these values by appropriate gates. The values add to 2" 
or for a constant function and to 2"/2 for a balanced 
function. This is analogous to the case of the expectation 
value quantum algorithm using maximally mixed states. 
In our view this scheme is thus fully classical, using sepa- 
rable states at all stages and camouflaged in the language 
of quantum mechanics. The exponential resource is ex- 
plicit in the classical situation while it is hidden in the 
definition of the input density matrix of Eq. ^ for the 
ensemble quantum case. The deceptively simple fact that 
one has effectively prepared the state of the first n qubits 
in the density matrix ^ {I ® I ® I ... ® I) requires that 
we have at least 2" molecules! 

Every density operator can be viewed as an ensemble 
of pure states occurring with certain probabilities. The 
existence of a decomposition 



P 



implies an ensemble {pi, IV'i)} for p, where the state \il>i) 
occurs with probability pi. It is to be noted that the 
states need not be orthogonal and the decomposi- 
tion given above is not unique for mixed states. However, 
whatever can be determined from p can be consistently 
thought of as deriving from any one of the ensembles. 



The ensemble scheme that culminates in Eq. ^ is illus- 
trated by considering the situation in which each ensem- 
ble member is taken to be in one of the computational ba- 
sis (pure) states, {|2:)|0) | x = . . . 2" — 1}, each of which 
occurs with probability px ~ 1/2". This corresponds to 
the input density operator of Eq. Now view the 

scheme as it is applied on each member of the ensemble, 
which means that each member of the ensemble indepen- 
dently computes the function on the input state of that 
member and the output appears on the (n-|- l)th qubit of 
each member molecule. The measurement of the average 
of then ostensibly reveals the constant or balanced 
nature of the function as described in Eq. 

Now imagine that one chooses to work with a fixed 
number of molecules M . As the number of qubits n in- 
creases, soon one will reach a stage when M < 2"/2. In 
this case more than half the computational basis states 
cannot find representation in the ensemble because there 
are simply not enough molecules! Therefore there are al- 
ways balanced functions which will have the same value 
over these M states and will get classified as constant, de- 
spite assuming the best situation, where all the molecules 
are assumed to be in different states. We will see later 
that the actual scheme is even more inefficient because all 
the molecules cannot be in different computational basis 
states. Thus, for the scheme to work for all functions one 
needs at least 2"/2 molecules in the ensemble, a number 
which grows exponentially with n. In other words, when 
the number of molecules is smaller than 2", there is no 
way one can prepare the input density matrix of Eq. ^ . 
We will return to this point later. 

How many balanced functions escape classification for 
a given Ml Assume = 2" is the input set size. The 
number of constant functions is 2 which is independent of 
n while the number of balanced function is If the 

scheme is used with a number of molecules 1 < M < 
the balanced functions which escape classification are the 
ones which have same value (0 or 1) for the first M inputs. 
The number of functions which have the value 1 (0) for 
the first Al inputs is the same as the number of ways 
one can distribute the remaining A^/2 — M I's (O's) on 
A^ — M inputs, giving: 



No. of Balanced Ens. 
Classified as Constant 



= 2 



N -M 
N/2 - M 



(8) 



Dividing this by the total number of balanced functions 
gives the fraction of balanced functions for which the 
schemes fails 



Failure Fraction = 2 



N -M 



N/2 -Mj' \N/2 



N 
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This fraction diminishes quite fast as M increases from 
toward increasing the efficiency of the algorithm. 
The fact that for most cases the scheme will work with 
a relatively small number of molecules has nothing to 
do with quantum mechanics. The classical randomized 
algorithm too will work to the same extent. In fact the 
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above counting is valid for the classical algorithm as well. 
It is well known that there is an efficient randomized 
classical algorithm for the DJ problem and we conclude 
that the expectation value ensemble scheme in the best 
case is equivalent to it p^ . 

Even if we use a molecule number M > 2" how safely 
can we say that the initial maximally mixed state has 
been realized and we are able to classify all functions with 
certainty? It is possible that, in a given experimental im- 
plementation of the algorithm, all ensemble members are 
in the same initial state |a;')|0). In this event, the algo- 
rithm only evaluates f{x') and the measurement outcome 
will be that for a constant function regardless of the na- 
ture of /. Thus, in contrast to the conventional Deutsch- 
Jozsa algorithm, there can be no way of determining the 
function type with certainty. In this sense, this ensem- 
ble algorithm for solving the Deutsch-Jozsa problem is 
not deterministic and must be compared to probabilistic 
classical algorithms. We shall consider the probability 
with which each correctly determines the function type 
and show that, regardless of the ensemble size, the stan- 
dard probabilistic classical algorithm is superior to this 
ensemble quantum algorithm. 

Suppose that the ensemble consists of M identical, in- 
dependent n + 1 qubit molecules. Each member of the 
ensemble will be subject to the unitary of Eq. which 
can be re-expressed as 

^/ = A)(/)«'/ + Pi(/)®a, (10) 

where 

Poif) 

x:f(,x)=0 

A(/) := E (11) 

x:f{x) = l 

project onto subspaces of n qubit argument while / and 
ax act on the target qubit. For the target qubit, (cr^) is 
approximated by 

M 

^-mE^. (12) 

where Zj = ±1 are the outcomes of projective measure- 
ment (zj = 1 corresponding to |0)(0| and Zj = —1 to 
|1)(1|) on the target qubit for individual ensemble mem- 
bers. 

The only assumption that we make about the ensemble 
members' initial states is that they occur with probabil- 
ities described by the density operator of Eq. Q . Then, 
for any ensemble member j, 

Prob(z, = +1 I /) = Trarg (Po(/)/Oin) (13a) 

Prob(z, = -1 I /) = Tra,g (A(/)pin) (13b) 

where measurements are performed immediately after al- 
gorithm unitaries and the traces are taken over the ar- 
gument register only. Note that each constant function 



yields one measurement outcome with certainty: Zj = +1 
for / = and Zj = -1 for / = 1. Thus z = +1 for / = 
and z — ~1 for / = 0. Whenever z departs from ±1 it 
is clear that / is balanced. However, the extent to which 
such a departure is noticeable depends on the available 
measurement resolution, which can be expressed in terms 
of outcomes of function register measurements on indi- 
vidual ensemble members. Suppose that it is possible to 
distinguish two ensemble averages only when they differ 
in R (out of M) or more individual measurement out- 
comes. Then we regard two ensemble averages as dis- 
tinct provided that |z — z'| > R/M. This motivates the 
following protocol for deciding the algorithm outcome: 

z>l- R/M ^ / (14a) 
z <-! + R/M ^ / = 1 (14b) 
1 - R/M > z > -1 + R/M => f balanced. (14c) 

The issue is to determine the probability with which this 
protocol will correctly identify the function type. Con- 
stant functions will always be identified correctly and we 
need only to find the probability that a balanced function 
will give z>l- R/M otz<-1 + R/M. These are the 
probabilities that a balanced function will return zj = —1 
or Zj = -|-1 at most R ~ I times respectively. For a bal- 
anced function Prob(zj = -|-1) = Prob(zj = —1) = 1/2. 
The probability that we incorrectly declare a balanced 
function to be constant is 



Pfail = 




In the best conceivable case R = 1, giving pfaii = 1/2^-'^ ^. 

To account for the spatial resources offered by the en- 
semble we consider the application of C/y to the ensem- 
ble containing M members as equivalent to M oracle 
calls. We must then compare this ensemble algorithm to 
a classical random algorithm that uses M oracle calls. 
In the classical random algorithm one begins by choos- 
ing xi randomly and evaluating f{xi). The next step 
is to choose X2 7^ xi, evaluate /(X2) and compare the 
result to f{xi). If the two differ then / is balanced. If 
not pick x^ which differs from both xi and X2, and com- 
pare /(xa) to f{x2) and f{xi), etc.... The algorithm ter- 
minates when / returns different outcomes or has been 
evaluated on 2"/2 + 1 different inputs. The classical ran- 
dom algorithm never niisidentifies a constant function 
and identifies a balanced function / as constant only 
when f{xi) — f{x2) — ■■■ — /{xm)- The probability 
of failure is the probability with which this occurs. The 
outcome f{xi) = occurs with probability {N/2)/N. 
Given f{xi) = 0, f{x2) = occurs with probability 
{N/2 - 1)/(A^ - 1). Continuing, the probabiUty that 
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xi, . . . , Xm are all such that f{xk) = is: 

_ N/2 N/2 - 1 N/2 -M+1 

^classical " TV - 1 ' ' ' TV - Af + 1 

Similarly the probability ^classical ^^slI xi, . . . , xm are all 
such that f{xk) = 1 can be computed and it turns out to 
be same as ^classical- Thus the probability of failure is: 

As expected, this turns out to be same as the failure frac- 
tion given in Eq. ^ for the quantum ensemble version 
with the unrealistic assumption that every molecule is in 
a different computational basis state. However, note that 
for iV/2 > k>0, {N/2-k)/{N-k) < 1/2, which implies 
that 

^classical < 3 j < ^^.^^^^ (^g) 

Thus the probability of failure for the classical random 
algorithm is strictly less than that of the ensemble quan- 
tum version discussed here. 

The central issue is therefore one of interpreting den- 
sity matrices. What is relevant here are the inferences 
that can be drawn from measurement outcomes on quan- 
tum systems whose states are described by density ma- 
trices. In all cases the density matrix merely provides 
the probability distribution for outcomes of various mea- 
surements. The accuracy with which such a distribution 
is realized improves with an increasingly large ensem- 
ble. Imagine a single quantum system which is handed 
over to us with no information about it. What quantum 
state or density matrix will we be able to assign to it? 
To express our complete lack of information about this 
system we have to assign equal weightage to all possible 



outcomes in all bases and therefore a density matrix pro- 
portional to identity is the best choice. In this extreme 
case, measurement yields one of all possible outcomes and 
one cannot reliably infer anything from this. The den- 
sity operator merely reflects our lack of knowledge in the 
state of the system. It is only when measurements are 
performed on many copies described by the same den- 
sity operator that outcomes or more precisely, the aver- 
age outcome, carry any meaningful information. This is 
no more than standard statistical sampling and for an 
n qubit density matrix proportional to the identity, the 
variance scales as 2^", indicating that one typically needs 
0(2") samples (measurement outcomes) to make sensible 
inferences from measurements. Accordingly the ensemble 
size would have to scale as 0(2") before we can consider 
this density operator to have been realized accurately, at 
least in terms of measurement outcome averages. Here 
the ensemble begins to appear as a collection of quan- 
tum systems with states described according to the den- 
sity matrix. It should be noted that, for this version of 
the DJ algorithm, the situation is less dire since infer- 
ences are made from measurements on the target qubit 
alone. Hence n does not appear in the failure probability 
in Eqn. (|15|l . However, as clear from Eqn. (|18|) a classical 
random algorithm does the task better. 

These ensemble computing ideas might work for other 
algorithms and give a genuine speed up over classical or 
classical random algorithms. O ne p ossibility is efficient 
simulation of quantum systems 12] and our result does 
not pertain to this. It is worthwhile to explore the exact 
implications of this model which will be taken up else- 
where. 



Acknowledgments 

Numerous stimulating discussions with R. B. Griffiths 
and Kavita Dorai are acknowledged. The research effort 
is funded by the NSF Grant No. 0139974. 



[1] E. Knill, I. Chuang, and R. Laflamme, Phys. Rev. A 57, 
3348 (1997). 

[2] I. L. Chuang, N. Gershenfeld, M. G. Kubinec, and D. W. 
Leung, Proc. R. Soc. Lond. A 454, 447 (1998). 

[3] D. G. Gory, M. D. Price, and T. F. Havel, Physica D 
120, 82 (1998). 

[4] L. J. Schulman and U. Vazirani, Proc. 31st AGM Sym- 
posium on Theory of Computing p. 322 (1999). 

[5] P. O. Boykin, T. Mor, V. Roychowdhury, F. Vatan, , and 
R. Vrijen, Proc. Natl. Acad. Sci. USA 99, 3388 (2002). 

[6] F. M. Woodward and R. Briischweiler (2000), arXiv: 
|quant-ph/0006024| 



[7] J. M. Myers, A. F. Fahmy, S. J. Glaser, and R. Marx, 

Phys. Rev. A 63, 032302 (2002). 
[8] R. Cleve, A. Ekert, G. Macchiavello, and M. Mosca, Proc. 

R. Soc. Lond. A 454, 339 (1998). 
[9] D. Deutsch and R. Jozsa, Proc. R. Soc. Lond. A 439, 

553 (1992). 

[10] W. S. Warren, Science 277, 1688 (1997). 

[11] A. Brazier and M. B. Plenio (2003), arXiv: 

quant-ph/0304017 
[12] E. Knill and R. Laflamme, Phys. Rev. Lett. 81, 5672 

(1998). 



